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CORRECTION OF OVERLAY OFFSET BETWEEN INSPECTION LAYERS 

CI ATM OF PRIORITY 
This application is a Continuation-in-Part application which claims the 
10 benefit of priority to United States patent application no. 09/747,497 filed 
December 22, 2000, entitled "CORRECTION OF OVERLAY OFFSET 
BETWEEN INSPECTION LAYERS IN INTEGRATED CIRCUITS" which 
claims the benefit of priority to United States Provisional Application No. 
60/233,343, filed September 18, 2000, entitled "CORRECTION OF OVERLAY 
15 OFFSET BETWEEN INSPECTION LAYERS IN INTEGRATED CIRCUITS." 



FIELD OF THE INVENTION 
The present invention relates to the manufacturing of integrated circuits. 
More specifically, the present invention relates to identifying defects in integrated 
20 circuits being manufactured on a wafer. 



20. Prior Art 
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Integrated circuits are commonly manufactured in batches on wafers. 
Multiple integrated circuits are manufactured on a single silicon wafer during the 
manufacturing process. Referring now to FIG. 1, there is shown a silicon wafer 
having a plurality of integrated circuits disposed thereon referenced by numbers 
5 12, 14, 16, and 18. 



During the manufacturing process, multiple masking processes are 
performed on the semiconductor wafer. Each masking process defines where 
features on various layers that make up the integrated circuit are to be positioned. 
*jj 10 For example, a layer of poly crystalline silicon may be deposited on a wafer. Then 
m photosensitive resist is coated on the wafer and selectively exposed to light so that 
± after developing the resist, the remaining resist forms a pattern. This pattern is 
I* then transferred to the polysilicon during an etch step so that after the remaining 
sEJ resist is removed, the polycrystalline silicon forms a pattern defined by the 
*f 15 selective light exposure. 

This masking and etching process sequence is repeatedly carried out on 
each wafer to create the intricate interconnected patterns of semiconductors, 
insulators, and metals needed to create the desired integrated circuit (IC). 

20 

The process described above is carried out to produce a plurality of IC's on 
a wafer as shown in FIG. 1. For example, a single silicon wafer may be the basis 
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for tens to thousands of IC's. The ability to manufacture a plurality of IC's on a 
single silicon wafer reduces the overall cost of production, thereby passing these 
cost savings onto the consumer in the form of inexpensive IC's. 

With advancements in technology, IC's have become very small and very 
complex. With each generation, IC features become smaller because of 
advancements in the manufacturing process described above. However, with these 
advancements it has become more difficult to detect defects. 



As part of the manufacturing process, in an effort to reduce the number of 
defective IC dice, it is common to inspect all of, or a sample of, the dice on a 
sampling of the wafers being produced. The inspection may be an optical 
inspection with very sensitive optical instruments capable of detecting defects of 
the size of a minimum feature of the IC or might be an electrical test that, in the 
case of memories, is capable of locating the position of an electrical defect to 
within the area of one small cell. The artifacts that are detected serve to guide the 
engineers to where defects, which may lead to yield loss, occur. 

Referring now to FIG. 1 there is shown a wafer having four dice labeled 12, 
14, 16, and 18. The labeled dice illustrate how a wafer may be inspected during 
manufacturing. For example, each of these dice, either picked randomly or 
according to some reason, would be inspected for defects. 
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Referring now to FIG. 2, there is shown a hypothetical collection of defects 
that might be observed in an inspection report generated from inspecting the dice 
shown in FIG. 1. A line defect is illustrated on die 12 in FIG. 2. A line defect 
may be caused by a scratch on the wafer. Also shown on dice 12, 14, and 16 in 
FIG. 2, are a plurality of "point" defects22, 26, 28, 34, 36, and 38. Also illustrated 
on die 1 1 in FIG. 2 is a large continuous defect 30 and a cluster of point defects 
32. 

Referring now to prior art FIG. 3, there are shown defects 42 through 68 
that might be observed on another inspection report of the same dice later in the 
manufacturing process. The outline of the die is shown for illustrative purposes 
only in FIGS 2 and 3 and would not be generally available in a standard defect 
inspection report. 

Defects that occur on one layer may propagate through and also appear on 
subsequent levels during the manufacturing and inspection process. For example, 
a large contaminating particle that remains after cleaning of the polysilicon layer, 
might penetrate through the intervening dielectric layer and be seen on the metal 
contact layer. It is important to identify these propagating defects so that the cause 
of the defects can be correctly assigned. 
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After locating the defects, an overlay report may be generated in which the 
defects from the inspections of the successive layers are displayed together with 
the intention of identifying those defects that are reported to be at the same 
location on inspection of successive layers. However, experience has shown that 
5 there is an offset in the reported origin of the inspections between layers. 

Referring now to FIG. 4, there is shown an exemplary overlay report as 
would be created after performing at least two inspections. As shown in FIG. 4, 
an offset in the origins of the coordinate systems used to report the location of the 

j O 10 defects may cause the defects from one layer to be incorrectly positioned with 
respect to those defects of a second layer. This offset has been a recurring 

4E problem in the manufacture of integrated circuits and, as the size of the integrated 

» . circuits features are reduced and the number of dice on a wafer increases, there is a 

13 

Q growing need to correct this problem. 

p 15 

Thus, when an overlay inspection report is generated as shown in FIG. 4, 
propagating defects may not be identified. As shown in FIG. 4, only individual 
point defects from the defect clusters 32 and 62 would be selected. An attempt to 
identify correlated defects as those lying within some critical distance of each 
20 other would mistakenly select the pair of defects 48, both of which come from the 
second inspection. A more sophisticated correlation algorithm based on defects 
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from successive inspections that lie within a critical distance would misidentify 
defects 45 and 26 as being correlated when in fact they are not. 

Therefore, there is a need for a procedure for automatically detecting the 
origin offset between the inspections and correcting the offset in the overlay report 
so that the correlated defects that propagate between layers can be identified 
correctly. The present invention provides a method and apparatus for correcting 
these. 

SUMMARY OF THE INVENTION 
The present invention is directed towards a method for determining the 
offset between at least two origins of a coordinate system used for at least two 
different defect inspection spaces. The method comprises: collecting multiple sets 
of data spanning defect inspection spaces; filtering the data sets to remove points 
that introduce noise into correlation calculations; determining whether different 
data sets show correlation; selecting pairs of data sets showing correlation greater 
than or equal to a metric; and calculating coordinate offsets of the at least two 
origins based on the said selected pairs of said data sets. 

The present invention is also directed towards a method for determining the 
offset between at least three origins of a coordinate system used for at least three 
different defect inspections of a wafer with integrated circuits disposed on it. An 
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embodiment of the method comprises: finding all possible pairwise links between 
layers; constructing a tree of links; identifying all indirect paths along which layers 
can be linked; calculating statistics of offsets between indirectly linked layers; 
determining whether any pair of layers are linked by multiple paths; listing each 
pair of layers linked by multiple paths, if there are any pair of layers linked by 
multiple paths; selecting a listed pair of layers that have not been previously 
selected; determining whether offsets associated with said listed pair of layers are 
within confidence limits of each other; selecting the best estimate of said offsets; 
determining whether the system has selected all the of the listed pairs; and 
selecting a listed pair of layers that have not been previously selected. 

The invention further relates to machine readable media on which are 
stored embodiments of the present invention. It is contemplated that any media 
suitable for retrieving instructions is within the scope of the present invention. By 
way of example, such media may take the form of magnetic, optical, or 
semiconductor media. The invention also relates to data structures that contain 
embodiments of the present invention, and to the transmission of data structures 
containing embodiments of the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is an illustration of a prior art wafer showing the division into an 
array of dice. 
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FIG. 2 is an exemplary illustration of defects that may be detected during an 
inspection process. 

FIG. 3 is an illustration of defects that may be detected after a second layer 
has been processed on a wafer. 

FIG. 4 is an illustration of an overlay report that may be created after at 
least two inspections showing defects as identified on different layers. 

FIG. 5 is an illustration of the distributed offsets in the x-direction observed 
between defect pairs observed on two layers. 

FIG. 6 is an illustration of the distributed offsets in the y-direction observed 
between defect pairs observed on two layers. 

FIG. 7 is a function flow diagram illustrating the implementation of one 
method of the present invention. 

FIG. 8 is a functional flow diagram illustrating a second automated method 
of the present invention. 

FIG. 9 is a flow diagram illustrating the implementation of one method of 
the present invention. 

FIG. 10 is a flow diagram illustrating the implementation of one method of 
the present invention. 

FIG. 1 1 is a schematic illustration of a portion of a memory array. 

FIG. 12 is a schematic illustration of overlay offsets that have been 
obtained for various layers. 
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FIG. 13 is graphical illustration of the measured distribution of x coordinate 
offsets between various layers. 

FIG. 14 is a flow diagram illustrating the implementation of one method of 
the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
Those of ordinary skill in the art will realize that the following description 
of the present invention is illustrative only and not in any way limiting. Other 
embodiments of the invention will readily suggest themselves to such skilled 
persons. 

In describing the present invention it is assumed that a wafer has been 
selected for which in-line inspection of data is available after the operations on 
two or more layers. In these descriptions, the problem of finding the offsets in the 
origins between the inspection data on two of the layers is described. From this 
disclosure it will be obvious that this operation can be readily and repeatedly 
applied on wafers having two or more layers. Additionally, the method of the 
present invention can be applied to wafers having more than two sets of inspection 
data. 



Still further, the defect coordinates and the distance between the related 
defects are described in Cartesian coordinates. However, it will be obvious to one 
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having ordinary skill in the art that the distance between related defects could be 
described in polar coordinates or may be described using other coordinate systems. 
The Cartesian coordinate system is typically used by the machines utilized to 
perform the inspection process. Therefore the present invention will be described 
in Cartesian coordinates as it will be the most convenient system to utilize. 

Seven separate embodiments of the present invention are separately 
described herein. Referring now to FIG. 7, there is shown a functional flow 
diagram depicting a first method of the present invention. 

At box 710, the method of the present invention begins with the setting of 
maximum X and Y values (Xmax and Ymax), e.g., the maximum allowed offsets 
between defects that propagate between layers during the manufacturing process. 
The values chosen by the user for Xmax and Ymax may vary depending upon the 
equipment utilized during the inspection process. The operator having knowledge 
of the equipment will chose appropriate values for Xmax and Ymax. For 
example, the operator may know from experience that the offsets are usually less 
than 60 micrometers. 

At box 720, the minimum spacing allowed for intralayer defects Xint and 
Yint are established. The values for defects Xint and Yint are dependent upon the 
scale of technology being utilized in the manufacturing process. As an illustrative 
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example, in the case of a technology with the minimum feature size of 0.5 
micrometers, defects on the same inspection layer spaced closer than 10 
micrometers might be considered to result from clustering. 

In use, defects that are disposed closer together than the pre-set Xint and 
Yint values on the same inspection layer are excluded from the origin offset 
calculations. The minimum values may be selected by the user or they can be 
selected automatically by using the following formulas: 

Xint > 2*Xmax 

Yint > 2*Ymax 

The limits for Xint and Yint are established to avoid confusion that may 
occur when there are two or more defects on one layer that can be overlaid on 
another defect that will be observed on a subsequent layer. 

At box 730, a database is searched for all interlayer defect pairs consistent 
with the restrictions selected previously at box 710 and box 720. The process of 
20 box 730 may be performed after at least two inspections have been performed on 
the wafer. After locating all defect pairs that satisfy the restrictions previously 
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selected (Xmax, Ymax, Xmin, Ymin), the offsets AX and AY in the coordinates of 
these defect pairs are calculated and saved for later use. 

At Diamond 740 it is determined whether the offsets calculated in box 730 
are distributed randomly with some selected probability. For example, the 
distribution could be tested to determine if the probability of the observed 
distribution occurring as a result of random processes is less than 1%. There are a 
number of methods known to those skilled in the art of performing statistical 
calculations that may be applied to achieve the desired results. See for example, 
Maurice G. Kendell and Alan Stuart, "The Advanced Theory of Statistics", Vol. 2 
Inference and Relationship, Hafner Publishing Co., New York (1967). 

If it is determined that the distribution could have occurred randomly, then 
the process proceeds to box 790 where the process stops. Alternatively, if it is 
determined that the distribution could not have occurred randomly, then the 
process proceeds to box 750. 

At box 750 the "dense zones" for the coordinate differences are determined 
as a result of the process of box 730. By "dense zones" is meant a region on each 
of the coordinate offset axes where there is a relatively high density of offsets 
found. In FIG. 5 it is seen that there is a relatively high density of offsets 
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occurring in the range -8 fim to +8 /im. In FIG. 6 a similar cluster of offsets is 
observed between -42 jim and -28 /zm. 

The histograms of distances between pairs seen on a wafer for the x- 
coordinate and y-coordinate are shown in FIGS. 5 and 6, respectively. Referring 
now to FIG. 5, there is shown an exemplary histogram that may be generated 
during an inspection process undertaken on wafer 10. As shown in FIG. 5, there is 
a peak value 510 in the distribution of differences in the region between 0 and 
+2[im. 

Referring now to FIG. 6, there is shown an exemplary histogram that may 
be generated during an inspection process undertaken on wafer 100. As shown in 
FIG. 6, there is a peak value 610 in the distribution of differences in the region 
between -36jwm and -34 /xm. 

Each of the dense zones in FIGS. 5 and 6 have roughly a Gaussian 
distribution because the central limit theorem assures that the scatter in a number 
of observations that arise from several causes will tend to approximately a 
Gaussian distribution as the number of observations increases. 

Referring now to FIG. 7, specifically box 750, there are a number of 
possible heuristic algorithms for determining the "dense zones." For example, the 
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range of coordinate differences, Xmax and Ymax, could each be divided into a 
number of equal segments, e.g. 10 segments. The number of points falling into 
each segment are counted and the segments containing the greatest number of 
points falling within that segment for x-coordinate differences and for y- 
coordinate differences are selected as the most likely location of the dense zone. 

Alternatively, the program can search for the set of intervals for which the 
count of observations is maximal using the following equations: 

(AXupper - AXlower) < Xmax/10; and 

(AYupper - AYlower) < Ymax/10. 

In the equations given above, AXupper is the upper value and AXlower is 
the lower value, respectively, for a segment of an ordered array of the AX values. 
AYupper and AYlower, are respectively the upper and lower values for a segment 
of an ordered array of AY values. The two examples described above are merely 
exemplary in nature and should not be considered limiting. As one who is skilled 
in the art will appreciate, there are many methods that may be utilized to 
determine the "dense zones"; each of which can be utilized with the invention 
described herein. 
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At box 760, after the dense zones have been identified, the standard 
descriptive statistics, i.e., the average, standard deviation, the upper and lower 
confidence limits, and an interval size for the confidence interval are calculated. 
The averages are taken as the best estimate of the offsets of the origins between 
the two layers and the confidence limits indicate the range within which the "true" 
offset is expected to lie. In this embodiment, the average is utilized as the estimate 
of the "true" offset both because of its intuitive appeal and because of its 
computational simplicity. It may be obvious to those ordinarily skilled in the art 
that other metrics could be utilized to define the "true" offset. For example, the 
median may be utilized as the estimate of the offset rather than the average. 
Similarly, one could use a maximum likelihood approach to estimate the "true" 
offset from the observations in the dense zones. 

At box 770 the calculated offset and confidence limits are reported to the 
user so that they may be utilized to determine propagating dice and other defects 
on wafer 100. 

At box 780, the process ends and is reset for use on the next wafer being 
processed. 

Referring now to FIG. 8, there is shown a functional flow diagram 
depicting a second automated method according to the present invention. With 
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reference to FIG. 8, an automated process for determining offsets is described. 
The process is referred to herein as ^-restricted sampling. The A-restricted 
sampling is applied to avoid the confusion that occurs when a defect is observed 
on one layer that can be overlaid on either two different defects on another layer. 
The ^-restricted sampling of the present invention should not be confused with the 
^-statistics of R. A. Fisher. 

For example, the members of the cluster 32 and the members of the cluster 
62 seen in FIG. 4 have the potential to confuse a calculation of offset. In the 
method described above, the restriction on the minimum spacing between defects 
on the same layer is used to remove clusters 32 and 62 from the offset calculation. 
In the embodiment disclosed with reference to FIG. 7, the restriction on the 
minimum spacing between defects on the same layer is used to remove the 
clustered defects 32 and 62 from the calculation. In an automatic embodiment of 
the present invention, the operator intervention of choosing Xmax and Ymax is 
eliminated. 

Referring now to FIG. 8, at box 810, the values for Xmax and Ymax are 
taken to be the dimensions of the die under study. These values are extracted from 
the database. The problem of confusion of multiple defects is avoided by 
restricting the dice on which the calculations are earned out to those for which the 
number of defects, nd, satisfies the constraint 0< nd < k where k is a small integer. 

16 
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Typically k is less than or equal to 3. If k is equal to 1, then the chance of 
confusion is zero. If, for example, k is set equal to 2, then assuming that for a die 
nd-2, and that none of the defects are propagating, 2 or 4 random pairs of AX and 
AY are introduced into the list of pair differences, depending upon whether there 
5 are two defects on both layers or on only one of the two layers. If one of the 
defects is propagating, there will be one matched pair and 1 or 3 random pairs 
introduced into the list of different pairs. If there are two propagating pairs, two 
matched pairs and no random pairs will be entered onto the list of different pairs. 

This reasoning can be carried out on the k=n, but it becomes clear that the 
number of possibilities of random pairs increases faster than the possibilities of 
matched pairs so that the "noise" can overwhelm the "signal". For this reason, the 
value of k is chosen to be < 3, and is typically chosen to be 1 for a first try at 
determining the offset. 

At box 820, all defect pairs are identified on selected dice on two different 
layers disposed on the wafer. 

At box 830, AX and AY of the pairs are determined according the process 
20 described above. 

17 



ET527025778US 

Docket No: HEUR-009CIP 

At Diamond 840, the newly determined AX and AY pairs are tested to 
determine if they are distributed randomly. If the pairs are randomly distributed 
then the process advances to box 890 and the process stops. Alternatively, if it is 
determined that the pairs are not randomly distributed, there is an observed 
5 propagation of defects and the process advances to box 850. 

At box 850, the dense zones for both X values and Y values are calculated 
as described above. 

At box 860, after locating the dense zones, the process proceeds to calculate 
an estimate of the origin of the offset and its statistical uncertainty as described in 
the process above. 

At box 870, the offset and the statistical uncertainty are displayed to the 

user. 

At box 880, the process is ended and reset for use on the next wafer being 
processed. 

20 When the automatic method described above and depicted in FIG. 8 is 

employed, the sample size of the number of pairs for which differences are 
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calculated will be small in cases where there are few dice with a small number of 
defects. This may lead to more uncertainty in the estimate of the offset. 

Each of the methods described herein may be utilized independently, or 
alternatively may be utilized concurrently. For example, the same data may be 
utilized for both processes. The user can then choose the calculated offset that 
gives the best result or utilize the result of the automatic approach as an input for 
the semi-automatic approach. 

The problem of removing offsets between inspection data for different 
layers of a manufacturing process was discussed above. The example of layers 
given was the results of inspections performed after subsequent processing steps , 
e.g. inspections performed after, active area formation, poly definition and first 
layer metal definition. 

However, layers may have the more general meaning of the results of 
separate inspections of an object. In integrated circuit manufacturing technology, 
separate layers could also mean multiple inspections of a wafer at the same point 
in the manufacturing flow performed with differing techniques, e.g. grazing angle 
laser scattering, dark field microscope inspection, and UV light induced 
fluorescence inspection. Layers could also mean the results of the repeated 
inspection of the same wafers at the same position is the manufacturing flow with 
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the same tool as, for example is a test to determine the rate of particulate 
accumulation on a wafer. Similar examples could be found for other technologies. 

A generalized flow chart for the invention is shown in FIG. 9. FIGS. 7 and 
5 8 are non-limiting examples of the general method, shown in FIG., 9, applied in a 
specific manner. First, more than one sets of inspection data are collected which 
cover an inspection space as indicated in act 910. In the examples given 
previously with regards to FIGS. 7 and 8, the inspection space is a planar surface, 
e.g. all or a portion of the surface of a wafer or a flat panel display. However, the 
© 10 method is clearly applicable to data sets from inspection of other forms as long as 
HI the location of points in the space can be described in an orthogonal coordinate 

*p system. The most obvious extension may be to curved two dimensional surfaces. 

if % i 

J For example, there is discussion of making displays that will fit on curved objects. 

13 It would still be possible to inspect the emitting surface of these displays and this 

t Tv. 

15 inspection could be used to find the offsets between the origin of the coordinate 
systems used for multiple inspections. Further thought shows that the inspection 
spaces need not be limited to two-dimensional ones. For example, one could 
inspect the surface of a two-handled cup, which can not be mapped onto a simply 
connected two-dimensional surface, and use this invention to find the offsets 

20 between the coordinate systems for multiple inspections. 
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In general, it is useful to filter the data in one way or another in order to 
improve the "signal to noise level" between the features detected in one inspection 
which are correlated to features detected in another inspection as is shown in act 
912. 

An example of a filter is given in the implementation discussed previously 
in which features that were closer than a minimum distance on the same layer 
were discarded as were features on different levels that were located farther from 
each other than some maximum distance. 

If the number of features is sufficiently sparse, filtering is not necessary, 
but the number of pairs available for calculating the offsets may be so low that the 
spread in the confidence limits associated with the result are fairly large. One way 
of dealing with this is to divide the inspection space into a number of spaces so 
that the number of features in each subspace is small enough that filtering within 
the subspace is not necessary. The spread in the confidence limits can be reduced 
in this approach by combining the results from the many subspaces. This is the 
approach used in the second embodiment of this invention with the addition of the 
filter that the number of features in a subspace must be less than some maximum. 
It should be clear from this discussion that an alternative approach would be to 
further subdivide any regions with an excessive number of features into two or 
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more subregions such that the number in any one region meets the criterion that 
the number of features is sufficiently sparse. 

After filtering, the sets are compared to see if the correlation between them 
exceeds some preset limit as indicated by the query 914. Means of testing for 
correlation are discussed in the following embodiments. Other means of testing 
for correlation are known to those of ordinary skill in the statistics art. 

If the correlation does not exceed some preset minimum, stop the 
procedure. If the correlation does exceed this limit, the pairs showing correlation 
greater or equal to some predetermined metric are selected as is indicated in act 
916. Means for such selection is discussed in other embodiments. Other means 
for this selection is known to those of ordinary skill in the statistics art. 

As is indicated in act 918, the coordinate offsets are then calculated. Means 
for calculating these offsets and alternatives are discussed in the other 
embodiments herein. 

Once the offsets are calculated, the procedure ends. The results may be 
reported to a user or employed in a procedure to, for example, automatically 
remove the offsets between the coordinates of the data sets so that the features 
observed in the inspection are stored and/or displayed "overlaid" on one another. 
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Another embodiment of an algorithm for finding the offset between the 
coordinate systems for two sets of inspection data from the same wafer is 
illustrated in the flow chart shown in FIG. 10. The method shown in FIG. 10 is 
also a non-limiting example of one embodiment of the more general method 
shown in FIG. 9, used in a specific application. The algorithm begins by filtering 
the data from each inspection and removing all defect clusters, i.e. all groups of 
defects for which the defects lie within some predetermined minimum data from 
each other as is indicated with the act 1010. 

After filtering each inspection data set to remove clusters, one inspection 
set is defined as a first layer. This is normally the layer from which defects may 
propagate to the second layer. For each defect in the first layer, the region on the 
second layer within a rectangle of predetermined size centered on coordinate of 
the defect in the first layer is searched for defects as is indicated in act 1020. (It is 
often found convenient to choose the rectangle to be a square.) For each defect 
found, its offset from the center of the rectangle, Ax and Ay, is calculated as is 
indicated in act 1030. 

The array of offsets, which extend from -AX max to +A Xmax in the x- 
coordinate and from -AY max to +AY max in the y-coordinate, is search for the 
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rectangle of predetermined size h x , h y that contains the most points as is indicated 
in act 1040. This is defined as the "dense zone". 

The dense zone will contain some p points. The question of whether the 
offsets are distributed randomly is determined by testing whether the probability 
that at least p of the offset pairs will lie within a rectangle of size (h x x h y ) is less 
than some predetermined small value, e.g. 0.05 as is indicated in query 1050. If it 
is, the distribution of offsets is not random. Other wise the offsets are taken to be 
a random distribution and the procedure stops as act 1090 shows. 

If the distribution is not random, the descriptive statistics for the dense zone 
are calculated are calculated in act 1060 and reported in act 1070. After this, the 
algorithm terminates as is shown in 1080. 

An electrical failure map is different is several ways from the defect maps 
generated from in-line inspection during processing. On the one hand, the 
electrical failures resulting from one localized defect can extend for long distances 
on a die. In the extreme case of a defect that causes a short between power supply 
traces at different potentials, e.g. V DD and ground, the entire die may fail. On the 
other hand, the electrical failure map is in some ways the principal layer because 
the usual goal of all of the inspections is to improve the die yield, i.e. to minimize 
the electrical failures. 
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In many practical cases such as bit failures in memory arrays, the location 
of the electrical defect may be restricted to a smaller area than the defects 
observed with the in-line inspections. 

Finally, because the electrical test is performed after all of the layers used 
to form the integrated circuit are processed and, consequently, after all of the in- 
line inspections are performed, correlation may be possible between the electrical 
failure map and many or all of the in-line inspection maps. This is in contrast to 
the in-line inspection where a defect first observed at, for example, layer 3 will 
usually only propagate forward for a few layers and cannot propagate to layers 
previously inspected so that propagation is usually restricted to a few layers 
occurring near each other in the process sequence. 

It is known that only a portion of the defects that are observed in with the 
in-line inspections will cause electrical failure either because of their location or 
their size. Those defects that cause electrical failures are often called "killer 
defects" and distinguished from "non-killer defects". However, in principle, the 
non-killer defects are no different than the defects that do not propagate from one 
layer to the next and are handled by the existing procedures. 
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As was noted previously, local physical defects can cause extended 
electrical failures. The physical location of defects causing failures detected 
electrically is in general difficult to determine, however, in the special case of 
memory arrays this can often be determined with reasonable precision. FIG. 1 1 
shows schematically a portion of a memory array, 1 100. The memory cells are 
arrayed regularly along a number of rows and columns. The location of each cell 
is determined uniquely by specifying which row and column the cell lies on, i.e. 
the row and column addresses. In the cases that an isolated cell fails, such as is 
shown for cells 1 1 10, 1 1 12, and 1 1 14, the defect can be assumed to lie within the 
area covered by the cell. In many modern memory technologies, the area 
occupied by a cell is < 1 /mi 2 so that the location is determined with some 
precision. 

Unfortunately, not all failures involve only a single cell. Failures also 
appear as pairs of cells, shown as 1 1 18, complete or partial rows, illustrated in 
1 130, complete or partial columns, illustrated as 1 132, or other groups of cells. In 
the case of small, contiguous groups such as pairs, the defect can be assigned to 
the area covered by the cells included in the group. Other groups such as rows and 
columns, and clusters of closely spaced but discontinuous clumps of cells must be 
filtered out of the data before correlation studies are under taken as has been 
discussed for clusters of defects observed with in-line inspections. 
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In the very common special case in which a Cartesian coordinate system is 
employed, overlay information can be extracted from the row and column failures. 
Consider the row of bit failures in FIG. 1 1 labeled 1 130. This row of failing bits 
could be the result of a short between the row select line, usually called the "word 
5 line", and ground in the cell location filled in and labeled 1 120 in FIG. 1 1 . 

However, a short between the word line and ground in one of the other cells along 
row 1 130, indicated by crosses in FIG. 1 1 would also result in failure of all of the 
cell along the row. Therefore, the row failure gives no information as to the 
location of the underlying physical defect along the row, i.e. in the x direction. 
; fi 10 However, it is known the defect is located within a small range of y coordinates, 
ffi i.e. the height of the row label 1 130. Analogous arguments can be made with 
j; respect to the column labeled 1 132. 



Ij Advantage can be taken of the additional information contained in the 

© 15 location of the row and column failures if the problem of finding the x and y 

coordinates of the offsets between electrical failures and defects observed with in- 
line inspection is separated into two independent problems, one for each 
coordinate, at the time of the filtering of the failures. In this approach the y 
locations of failing rows or partial rows are included in determination of the 
20 overlay offsets in the y direction and the locations of failing columns or partial 
columns are included in determination of the overlay offsets in the x direction. 
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After the overlay offsets have been determined between the various in-line 
inspections and between the defects observed with in-line inspection and the 
electrical failures, it is often observed that not all of the offsets are independent of 
each other. 

Consider FIG. 12. In this figure, the overlay offsets that have been 
obtained between the in-line inspections of the various process layers 1 through n 
and between the in-line inspections and the electrical failures, layer "e", are shown 
schematically. For example, the line with arrows between layers 1 and 2 indicate 
that an best estimate of the overlay offset (i.e. the average offset) has been 
determined between these layers so that it is known that 

x l = x 2 + AX, 2 with some standard deviation s( AX, 2 ) eq. la) 

and yi = y 2 + A} u witn som e standard deviation s(A^ 2 ) eq. lb) 

where the best offset estimate and its uncertainties have been determined from the 
defects lying in the dense zone, which number N 12 . These offsets are determined 
by the equations: 




and 



eq. 2a) 



A^, 2 = 




eq. 2b) 



where Ax, 2 and Ay l2 are the individual offsets found in the dense zone. 
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The standard deviations are determined by the equations 




and 



eq. 3a) 




eq. 3b) 



DF 12 



where DF l 2 is the degrees of freedom of the average and equal to (N 12 -1). 

Similar statements can be made with respect to layer 2 and the electrical 
failure layer "e". The inspection results on layer "m" are shown to have a known 
offset to those on layers "p" and "n". Additionally, the offsets between the 
inspection results on layers "m" and "p" and the locations of the electrical failures 
on layer "e" are known. (In the case of the electrical bit maps for which row and 
column failures are used in the dense zone calculations, the number of elements in 
the dense zones for the x and y offsets may not be the same resulting in slightly 
different error statistics, as will be appreciated by one of ordinary skill in the art.) 

Overlay offsets may not be determined for all layers inspected because the 
procedure may fail as discussed above, as is shown in the case of layer "3". 
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In some cases, such as the in-line layers labeled group A in FIG. 12, the 
overlay offset between layers "1" and "e" can be determined via layer "2". By 
analogy with equations 1, one can write 

x 2 = x e + AX, e with some standard deviation s(AX 2e ) eq. 4a) 

and y 2 = y e + AY 2 e with some standard deviation s{AY 2 e ) . eq. 4b) 
Combining equations 1 and equations 2 results in 

x l = x e + AX le with some standard deviation s(AX e ,) eq. 5a) 

and y, = y e + AY U with some standard deviation s(AY ei ), eq. 5b) 



where the offsets are given by the equations 

&X le =AX u +&X l2 and eq. 6a) 

and AY l e =AY 2e +AY 12 eq. 6b) 

The uncertainties in the offsets between layers "e" and "1" will be greater 
than the uncertainty in either the offset between "2" and "e" or that between "1" 
and "2". In the case that the offsets between layers are independent of each other, 
one can write 



(DF, 2 + DF V ) 
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The number of degrees of freedom of the offset between layers e and 2 is 
the sum of those between layers e and 1 and layers 1 and 2. 

Those of ordinary skill in the statistical arts will know of other algorithms 
5 for calculation of the uncertainty of the estimate of the offset of the coordinate 
systems between layers e and 1 based on the estimates of the offset between layers 
e and 2 and the offset between layers 2 and 1 . 

It can be proven by induction that eqs. 6 can be extended to in the case of k 
10 steps between two layers to 

W X = X*< eq.8)) 

where the sum is a vector sum over the vector offsets. 

£ Similarly, eqs. 7 can be extended by induction for the case of k steps 

15 between two layers to 



s(A X\,k ) '■ 



AX..* 

k Z 



eq. 9) 



1=1 



where the number of degrees of freedom of the whole is the sum of the 
degrees of freedom of the k steps along the path. 
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In other cases there may be more correlation equations than there are 
variables so that the offsets are overdetermined. This occurs with respect to the 
in-line inspection layers labeled B in fig 12 and the electrical failure level "e". 

Following the previous discussion, layer n can be related to layer e with 
either of the following set of equations: 
AX = AX + AX 

n,e m,e m,n 

eqs. 10) 

AF =Ay +A7 

n,e m,e m,n 

or 

AX^AX^+AX^ 

eqs. 11) 

AY nie =AY p ^AY p>n 

Eqs. 7a and 7b suggest an answer as to which of the above sets of equations 
should be used. The standard deviation of each independent path is calculated. 
The path with the smaller uncertainty, e.g. the standard deviation or standard 
error, is chosen as the best estimate of the offset of the coordinate system of the 
inspection data for the layer n from the coordinate system of the electrical failure 
map, layer e. 
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In some cases, advantage can be taken of the extra information available on 
the correlations within group B and between members of group B and layer "e" to 
improve the estimates of the offsets. 

This advantage is illustrated in FIG. 13. The curve 1310 represents the 
measured distribution of x coordinate offsets between layers "e" and "m" while the 
curve 1 320 represents the measured distribution of x coordinate offsets between 
layers "e" and "p". From these distributions the offsets AX e , m and AX e>p are found. 
However, the distribution of the x offsets between layers "m" and "p" is also 
measured. In the illustration, it is assumed that the expected value of AX^p is less 
than the value of [AX e p - AX e m ]. This is not really contradictory because the 
difference is within the uncertainties in the determination of the three values. In 
fact, the information from the offset between layers m and p can be used to refine 
the estimates of the values of the offsets AX e m and AX e p . The improved estimates 
are shown by the dashed lines 1312 and 1322 in FIG. 13. 

Formally, assume that there two independent paths are found to estimate 
the offsets between layers f and g. Let us denote one path by U and the other path 

— > 

by V . Then 
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and 



eqs. 12) 



where the path £/ has n u steps and the path V has n v steps. 



The offset bX v _ f4l has DF y degrees of freedom and the path AX V f has 
DF V degrees of freedom that are given by the equations 

DF u =f,DF l 



1=1 

and 

DF v =f,DF t 



eqs. 13) 



The standard deviations associated with these components are 



S[D^x 5 (AX,) 2 ] 



1 



[=\ 



DF, 



IU 



and 



eqs. 14) 
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\1L[dF iv xs{AX v ) 2 



DF, 



IV 



1=1 



The best estimate of the offset between f and g is given by 

f ' S (DFu + nu) + (DF V + n v ) 

The standard deviation of this estimate is given by 

*(AZ ) = \ DF ^ Xs ( AX ^f.S + DF vX<^v.f, g ) 2 
1 U) 1 (DF U+ DF V ) 



eq. 15) 



eq. 16) 



An embodiment for finding the offsets between the coordinate systems of at 
least three sets of data representing the defects found in the inspections on three 
layers of an integrated circuit wafer is summarized schematically in the flow chart 
shown in FIG. 14. 



The method begins by finding all of the offsets that can be determined 
between pairs of layers by any of several alternative methods described in this 
disclosure as is shown in act 1410. For convenience, the offsets found between 
the coordinates on the various layers will be referred to as the links between the 
layers. Once the possible links between the pairs of layers are known, a tree of 
links similar to that illustrated in fig 1 1 is constructed as indicated in act 1412 of 
fig 14. 
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As has been discussed, if the links between layers a and b and between 
layers b and c are known, the link between layers a and c can be found. The link 
between two layers that is found as a result of links that these two have to other 
layers is referred to as an indirect link. Once the tree of direct links is established 
as indicated in act 1412, the possible indirect paths can be found by several 
methods including an exhaustive search, as is indicated in act 1414. Once the 
indirect links are identified, the statistics can be calculated using eqs. 6 through 9 
as is indicated in act 1416. 

As indicated in query 1418, the resulting links are now tested to ascertain if 
any layers are linked by multiple paths, i.e. by paths which provide links through 
different sets of layers. If not, the process terminates. If there are layers linked 
with multiple paths, the layers with these multiple paths are listed, as is shown in 
act 1422. 

A first pair with multiple paths is selected at act 1424 and tested to see if 
the coordinate offsets indicated by the alternative paths agree within some 
confidence limit at query 1428. If not, a warning is sent to the operator or 
engineer at act 1426. If there is agreement, a best estimate is selected at act 1430 
by either of the alternative methods discussed previously. This process is repeated 
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until all of the pairs linked by multiple paths are examined, query 1434 and acts 
1432 and 1436. 

Although the teachings of this invention have been described in terms of 
inspection in the manufacture of integrated circuits, it should be apparent to one of 
ordinary skill in the art that the teachings of this invention can be applied to other 
manufacturing operations in which inspections are performed on various layers 
during the manufacturing process and it is desired to remove offsets between the 
inspection data from various layers. Non-limiting examples of this might be in the 
manufacture of display panels or multilevel printed wiring boards. 

While embodiments and applications of this invention have been shown 
and described, it would be apparent to those skilled in the art that many more 
modifications than mentioned above are possible without departing from the 
inventive concepts herein. For example, although the present invention has been 
described in detail regarding the utilization of specific mathematical equations, it 
should be understood by one skilled in the art that there are a plurality of equations 
that may be utilized instead of those described herein. The invention, therefore, is 
not to be restricted except in the spirit of the appended claims. 
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